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IN THE CLAIMS 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 




Claim 1 (Currently Amended): A document classification system for classifying a 
rent based on contents of the document of which contents contains a plurality of items, 
said documennclassification system comprising: 

inputtinjWeans for inputting document data corresponding to the document data; 
displavingyieans for displaying the items of the document data input by said 
inputting means; 

designating mdkns for designating at least one of the items contained in th e docum e nt 
input by said inputting rm e ans displayed in said displaying means ; 

converting means for converting the document data into converted data so that the 
converted data contains only d^ta corresponding to the item designated by said designating 
means; and 

classifying means for classli|ying the document by using the converted data produced 
by said converting means. 

Claim 2 (Original): The document classification system as claimed in claim 1, 
wherein said classifying means includes document vector producing means for producing a 
feature vector representing a feature of the converted data so as to classify the document in/ 
accordance with the feature vector produced by s^id document vector producing means. 

Claim 3 (Original): The document classificati&n system as claimed in claim 1, 
wherein said converting means includes separation sign mserting means for inserting a 
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predetermined sign between sets of data corresponding to the items so as to facilitate 
separation of each data corresponding to each item in the converted data. 

Claims 4-6 (Canceled). 

Claim 7 (Currently Amended): A processor readable medium storing program code 
causing a computer to classify a document based on contents of the document of which 
contents contains a plurality of items, comprising: 

first program code means For inputting document data corresponding to the document 
data, \ 

second program code means Vor displaying the items of the document data input by 
said first program code means; \ 

s e cond third program code means for designating at least one of the items contain e d 
in the document displayed by the secondWogram code means ; 

third fourth program code means for converting the document data into converted data 
so that the converted data contains only data corresponding to the item designated by the 
s e cond third program code means; and \ 

fourth fifth program code means for classifying the document by using the converted 
data produced by the third fourth program code means. 

Claim 8 (Currently Amended): The processo\ readable medium as claimed in claim 
7, wherein the fourth fifth program code means includes fifth sixth program code means for 
producing a feature vector representing a feature of the converted data so as to classify the 
document in accordance with the feature vector. \ 
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Claim 9 (Currently Amended): The processor readable medium as claimed in claim 
7, wherein the third fourth program code means includes sixth seventh program code means 
for inserting a predetermined sign between sets of data corresponding to the items so as to 
facilitate separation of each dam corresponding to each item in the converted data. 



Claim 10 (Currently Amended): A document classification system for classifying a 
document according to contents of the document, said document classification system 
comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 
vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 



transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected with a dimensional number different from a 
dimensional number of the document feature vector ; 

~ u~ 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; and 

classification result storing means for storing^ a result of classification performed by 
the classification means. 

Claim 1 1 (Original): The document classificatiork system as claimed in claim 10, 
further comprising inner product calculating means for calculating an inner product between 
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\ 

the document feature vectors, wherein 'said representation transforming function calculating 
means calculates the representation transforming function by using the inner product. 



Claim 12 (Currently Amended): Th e docum e nt classification syst e m as claim e d in 
claim 1 1 , furth e r comprising A document ^classification system for classifying a document 
according to contents of the document, saill document classification system comprising: 

input means for inputting documenldata of the document; 

analyzing means for analyzing the dlcument data so as to obtain analysis information; 

vector producing means for producing a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector {transforming means; 

classification result storing means for storing a^result of classification performed by 
the classification means; 

inner product calculating means for calculating an\inner product between the 
document feature vectors, wherein said representation transforming function calculating 
means calculates the representation transforming function by Vising the inner product; and 

document similarity information setting means for setting document similarity setting 
information including data representing an author of the document and a date of production 
of the document, wherein said representation transforming function calculating means 
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calculates the representation transforming function by using the inner product and the 
document similarity information. 

Claim 13 (Original): The doc\iment classification system as claimed in claim 10, 
further comprising: 

vector storing means for storin'g the document feature vector produced by said vector 
producing means; and 

transforming function storing means for storing the representation transforming 
function calculated by said representation transforming function calculating means. 

Claim 14 (Currently Amended): ihe docum e nt classification system as claimed in 
claim 1 0, furth e r comprising A document classification system for classifying a document 
according to contents of the document said pocument classification system comprising: 

input means for inputting document data of the document; 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing avdocument feature vector with respect to the 
document data based on the analysis informatior 

transforming function calculating means fbr calculating a representation transforming 
function used for projecting the document feature Rector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming th^ document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by the vector transforming means; 
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classification result storing means for storing a result of classification performed by 



the classification means; and 

vector correcting means for correcting the document feature vector before the 
document feature vector is transformed by said vector transforming means, a correction being 
performed by processing one of the document feature vector and a feature dimension 
constituting the document feature vecior in accordance with a rule established by 
characteristics of words extracted by said analyzing means. 



Claim 15 (Original): The document classification system as claimed in claim 14, 
further comprising transforming function correcting means for correcting the representation 
transforming function calculated by said transforming function calculating means when the 
feature dimension is changed due to a correction of the document feature vector by said 
vector correcting means so that the document feature vector is transformed by said vector 
transforming means in accordance with tie changed feature dimension. 



Claim 16 (Currently Amended): Tn e docum e nt classification syst e m as claim e d in 
claim 10, further comprising: A document Classification system for classifying a document 
according to contents of the document, said document classification system comprising: 

input means for inputting document data of the document: 

analyzing means for analyzing the document data so as to obtain analysis information; 

vector producing means for producing a aocument feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for Calculating a representation transforming 
function used for projecting the document feature veqtor onto a space in which similarity 
between the document feature vectors is reflected; 



7 



Application No. 09/288,856 
Reply to Office Action of March 26, 2003 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying the document based on similarity between the 
document feature vectors transformed by.the vector transforming means; 

classification-result storing means for storing a result of classification performed by 



the classification means; 

transforming function correction instructing means for sending an instruction 
regarding a process to be applied on a fqature dimension of the representation transforming 
function; and 

means for correcting the representation transforming 
function based on a content of the instruction sent from said transforming function correction 
instructing means. 



Claim 17 (Original): The document classification system as claimed in claim 16, 
wherein the process indicated in the content of the instruction is performed by using data of 
an arbitrary document vector. 

Claim 18 (Original): The document classification system as claimed in claim 16, 
wherein the process indicated in the content of the instruction is performed by using the 
document feature vectors. 



Claim 19 (Original): The document classification system as claimed in claim 16, 
wherein the process indicated in the content oi the instruction is performed by using the 
analysis information obtained by said analyzing means. 
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Claim 20 (Original): The document classification system as claimed in claim 16, 



wherein the process indicated in the content of the instruction is performed by using the result 
of classification stored in said classification-result storing means. 



Th e docum e nt classification syst e m as claim e d in 



Claim 21 (Currently Amended): 
claim 10, furth e r comprising: A document ^classification system for classifying a document 



according to contents of the document, s 



\ 



aid document classification system comprising: 



input means for inputting docum< mt data of the 



document; 

analyzing means for analyzing 



the 



document data so as to obtain analysis information; 



vector producing means for produping a document feature vector with respect to the 
document data based on the analysis information; 

transforming function calculating means for calculating a representation transforming 
function used for projecting the document feature vector onto a space in which similarity 
between the document feature vectors is reflected; 

vector transforming means for transforming the document feature vector by using the 
representation transforming function; 

classification means for classifying th& document based on similarity between the 
document feature vectors transformed by the vector transforming means; 

classification result storing means for storing a result of classification performed by 
the classification means; 

an initial cluster centroid designating mean^ for designating an initial cluster centroid; 

and 

initial cluster centroid registering means for registering the initial cluster centroid 
designated by said initial cluster centroid designating means, 
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wherein said classification mlans classifies the document in accordance with the 

initial cluster centroid registered by said initial cluster centroid registering means. 

Claim 22 (Original): The document classification system as claimed in claim 21, 



wherein the initial cluster centroid de: 



ignated by said initial cluster centroid designating 



means is arbitrary document vector data. 

Claim 23 (Original): The document classification system as claimed in claim 21, 
wherein the initial cluster centroid designated by said initial cluster centroid designating 
means is the document feature vector. 

Claim 24 (Original): The document classification system as claimed in claim 21, 
wherein the initial cluster centroid designated by said initial cluster centroid designating 
means is the analysis information obtained by said analyzing means. 

Claim 25 (Original): The document alassification system as claimed in claim 21, 
wherein the initial cluster centroid designated \>y said initial cluster centroid designating 
means is the result of classification stored by said classification-result storing means. 

Claims 26-41 (Canceled). 



Claim 42 (Currently Amended): A processor readable medium storing program code 
causing a computer to classify a document according to Contents of the document, 
comprising: 

first program code means for inputting document fiata of the document; 
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second program code means for analyzing the document data so as to obtain analysis 

information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector onto a space in which similarity between the 
document feature vectors is reflected with a dimensional number different from a 



dimensional number of the document feature vector ; 



fifth program code means for transfo rming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; and 

seventh program code means for storing a result of classification performed by the 
classification means. 



Claim 43 (Original): The processor readable medium as claimed in claim 42, further 
comprising eighth program code means for calculating an inner product between the 
document feature vectors, wherein the representation transforming function is calculated by 
using the inner product. 

Claim 44 (Currently Amended): Th e proc e ssor r e adabl e medium as claimed in claim 
4 3, furth e r comprising A processor readable mediunA storing program code causing a 
computer to classify a document according to contents of the document, comprising: 



first program code means for inputting document data of the document; 
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second program code means for analyzing the document data so as to obtain analysis 

information; 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vectoy onto a space in which similarity between the 
document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for/storing a result of classification performed by the 
classification means; 

eighth program code means for/calculating an inner product between the document 
feature vectors, wherein the representation transforming function is calculated by using the 
inner product; and 

ninth program code means fof setting document similarity setting information 
including data representing an author of the document and a date of production of the 
document, wherein the representation transforming function is calculated by using the inner 
product and the document similarity information. 

Claim 45 (Original): The processor readable medium as claimed in claim 42, further 
comprising: 

tenth program code means f<[>r storing the document feature vector produced by the 
third program code means; and 
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eleventh program code means for storing the representatioryffansforming function 

calculated by the fourth program code means. / 

Claim 46 (Currently Amended): Th e proc e ssor readabl e m e dium as claim e d in claim 
42, furth e r comprising A processor readable medium storing program code causing a 
computer to classify a document according to contents of me document, comprising: 

first program code means for inputting document/data of the document; 

second program code means for analyzing the document data so as to obtain analysis 
information; / 

third program code means for producing a dodument feature vector with respect to the 
document data based on the analysis information; / 

fourth program code means for calculating a[ representation transforming function 
used for projecting the document feature vector ontfo a space in which similarity between the 
document feature vectors is reflected; / 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; / 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fifth program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; and 

twelfth eighth program code means for correcting the document feature vector before 
the document feature vector is transformed by the fifth program code means, a correction 
being performed by processing one of the document feature vector and a feature dimension 
constituting the document feature vector in accordance with a rule established by 
characteristics of words extracted by the second program code means. 
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Claim 47 (Currently Amended): The processor readable medium as claimed in claim 
46, further comprising thirt ee nth ninth program cod^means for correcting the representation 
transforming function calculated by the fourth program code means when the feature 



dimension is changed due to a correction of the document feature vector by the tw e lfth eighth 
program code means so that the document featur^ector is transformed by the fifth program 
code means in accordance with the changed feature dimension. 

Claim 48 (Currently Amended): Th e proc e ssor r e adabl e m e dium as claim e d in claim 
42, furth e r comprising: A processor readable medium storing program code causing a 
computer to classify a document according to contents of the document, comprising: 

first program code means for inputting document data of the document; 

second program code means for analyzing the document data so as to obtain analysis 
information; I 

third program code means for producing a document feature vector with respect to the 
document data based on the analysis information; 

fourth program code means for calculating a representation transforming function 
used for projecting the document feature vector jpnto a space in which similarity between the 
document feature vectors is reflected; \ 

fifth program code means for transforming the document feature vector by using the 
representation transforming function; \ 

sixth program code means for classifying the document based on similarity between 
the document feature vectors transformed by the fiftA program code means; 

seventh program code means for storing a result of classification performed by the 
classification means; \ 
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t 



fourteenth eighth program code means for sending an instruction regarding a process 



to be applied on a feature dimension of the representation transforming function; and 

fift ee nth ninth program code means for ^orrecting the representation transforming 
function based on a content of the instruction sent by the fourt ee nth eighth program code 
means. 



Claim 49 (Currently Amended): Th e [proc e ssor r e adabl e m e dium as claim e d in claim 
4 2, further comprising: A processor readable medium storing program code causing a 



computer to classify a document according tj contents of the document, comprising: 



first program code means for inputti lg document data of the document; 



second program code means for ana yzing the document data so as to obtain analysis 



information; 

third program code means for producing a document feature vector with respect to the 



document data based on the analysis information 



fourth program code means for calculating a representation transforming function 



used for projecting the document feature vector onto a space in which similarity between the 



document feature vectors is reflected; 

fifth program code means for transforming the document feature vector by using the 



representation transforming function; 

sixth program code means for classifying the document based on similarity between 



the document feature vectors transformed b 



seventh program code means for stoi 



the fifth program code means; 



ing a result of classification performed by the 



classification means; 

sixte e nth eighth program code means for designating an initial cluster centroid; and 
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s e v e nt ee nth ninth program code means for registering the initial cluster centroid 
designated by the sixt e enth eighth progijam code means, 

wherein the document is classified in accordance with the initial cluster centroid 
registered by the seventeenth ninth program code means. 



Claim 50 (New): A document 
on contents of the document of which 



classification system for classifying a document based 
contents contains a plurality of item s wh ic h are ' 



irrespective of chapters, clauses, sentences and paragraphs of the document, said document 
classification system comprising: 

inputting means for inputting document data corresponding to the document; 
displaying means for displaying the items of the document data input by said 
inputting means; 

designating means for designating at least one of the items displayed by said 
displaying means; 

converting means for converting the document data into converted data so that the 
converted data contains only data corresponding to the item designated by said designating 
means; and 

classifying means for classifying the document by using the converted data produced 
by said converting means. 
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